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EXECUTIVE  OFFICE  OF  THE  PRESIDENT 

NATIONAL  SCIENCE  AND  TECHNOLOGY  COUNCIL 

WASHINGTON,  D.C.  20502 


September  23,  2004 


Dear  Colleague: 

This  report  provides  an  update  on  the  five-year  plan  for  Federal  agencies  dealing  with 
genomics  activities  related  to  domesticated  animals.  This  plan  was  developed  by  the 
NSTC  Committee  on  Science’s  Interagency  Working  Group  (IWG)  on  Domestic  Animal 
Genomics  in  2003.  The  IWG  was  originally  chartered  in  March  2002,  to  raise  awareness 
of  the  importance  of  domesticated  animal  species  and  to  address  and  coordinate  Federal 
programs  in  domestic  animal  genomics.  The  IWG  consulted  both  scientists  and  industry 
representatives  in  developing  the  five-year  “road  map”  plan. 

Significant  milestones  have  been  achieved  in  the  past  few  years  in  genomics  research  on 
humans,  research  model  animals,  plants,  and  microorganisms,  both  domestically  and 
internationally.  The  extension  of  these  efforts  to  include  the  domestic  animal  species 
offers  the  opportunity  to  elucidate  the  mechanisms  underlying  human  disease  via  more 
complete  understanding  of  evolutionary  relationships  between  species  leading  to  the 
development  of  new  therapeutics.  Additionally,  domestic  animal  genomics  offers  the 
opportunity  to  improve  animal  health,  increase  food  quality,  safety  and  nutritional  value, 
improve  human  nutrition,  and  increase  production  efficiency  of  animal  agriculture. 

Judging  from  the  progress  made  in  the  past  year  through  activities  described  in  this 
report,  there  is  every  indication  that  continued  significant  advances  will  be  achieved  in 
the  coming  year,  particularly  in  completing  critical  infrastructure  in  the  areas  of  structural 
genomics  and  bioinformatics.  The  IWG  will  continue  to  coordinate  the  activities 
contributing  to  the  five  year  plan  to  ensure  that  U.S.  efforts  in  domestic  animal  genomics 
benefit  from  interagency  support  and  cooperation,  keeping  U.S.  scientists  at  the  forefront 
of  animal  biology  and  its  application  to  solving  global  problems  in  public  health, 
agriculture,  energy,  and  environmental  protection. 
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Coordination  of  Programs  on  Domestic  Animal 
Genomics:  The  Federal  Framework 

Progress  Report 


I.  Executive  Summary 

The  Interagency  Working  Group  (IWG)  on  Domestic  Animal  Genomicswas 
chartered  in  2002  with  the  charge  of  enhancing  interagency  communication 
and  awarenessof  the  importance  of  agricultural  and  companion  animal 
species,  increasing  leverage  of  investmentsacrossgovemmentagencies,  and 
positioning  of  agriculture  asa  critical  element  of  the  Fed  era  I  genomics 
programs.  The  Committee  on  Science  providesoversight overthe  IWG 
activities.  The  membership  of  the  IWG  consistsof  representatives  from  the 
Department  of  Agriculture  (USDA),  Department  of  Energy  (DOE),  Food  a  nd 
Drug  Administration  (FDA),  National  Institutesof  Flealth  (NIH),  National  Science 
Foundation  (NSF),  Office  of  Sc  ience  a  nd  Tec  hnology  Policy  (OSTP),  Office  of 
Managementand  Budget  (OMB),  and  U.S.  Agency  for  Internationa  I 
Development  (USAID).  The  IWG  subsequently  identified  the  following  broad 
strategic  goals: 

•  Bring  into  place  the  programmatic  elements  needed  to  advance  the  study 
and  understanding  of  domesticated  animal  genomes,  including  large-scale 
DNA  sequencing;  functional  characterization  of  expressed  genes(functional 
genomics);  tools  fordata  storage,  analysisand  visualization  (bio  informa  tics); 
and  study  of  similarities  among  genomes  (compa  rative  genomics). 

•  Leverage  the  national  infrastructure  for  large-scale  DNA  sequencing  that  has 
been  established  forthe  Human  Genome  Projectand  other  vertebrate  and 
model  orga  nism  genomes. 

•  Advance  and  utilize  the  enabling  toolsand  infrastructure  of  functional 
genomicsand  bio  informa  tics  to  enhance  the  understanding  notonlyof 
basic  science  and  disease  mechanisms,  butalso  to  address  critic  a  I 
agricultural  missions,  including  animal  health  and  well-being,  food  safety, 
a  nd  huma  n  nutrition. 

•  Ensure  that  genomicsdata  are  freely  available  in  the  public  domain  and 
genomicsreagentsand  resourcesare  available  to  the  public. 

•  Increase  the  training  opportunitiesforgenomicsand  bioinformatics  at  a  II 
levelsof  education. 

•  Coordinate  and  encourage  international  cooperation  to  achieve  these 
goals 


COORDINATION  OF  PROG  RAMS  ON  DOMESTIC  ANIMAL  GENOMICS;  THE  FEDERAL  FRAMEWORK 


The  (*i*n»mic  Mup  of  I  In*  C  'hlckcn 


\ 


In  September  2003,  the  IWG  released  its  initial  plan  entitled  "Coordination  of 
Programson  Domestic  Animal  Genomics:  A  Federal  Framework”  in  which  large- 
scale  sequenc  ing,  data  management  and  bioinformatics,  and  functional 
genomieswere  identified  asthe  specific  goalsto  be  achieved  in  fiscal  years 
2003  to  2007  (see  http://www.ostp.gov/NSTC/html  Animal_Genome%7EWEB.pdf). 
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In  this  report,  progress  in  the  first  year  of  the  effort  ispresented.  Hig  hlig  hts 

include: 

•  Completion  of  bovine  and  chicken  BAC  mapswith  the  same  nearing 
completion  forswine. 

•  Progresson  producing  integrated  physical  and  genetic  mapsforbovine, 
swine,  and  chicken. 

•  Completion  of  the  first  draft  assembly  of  the  honeybee  and  chicken  genome 
sequences. 

•  Significant  progress  in  completing  the  first  assembly  of  the  dog  genome 
sequence. 

•  Successful  launching  ofthe  bovine  genome  sequencing  project. 

•  Activities  ofthe  International  Swine  Genome  Sequencing  Consortium  in 
securing  support  to  launch  the  porcine  genome  sequencing  project. 

•  Successful  application  of  genomics  tech  no  logy  in  addressing  the  US  BSE 
cattle  situation. 


Also  reported  are  plansforthe  coming  yearforthe  IWG,  including: 

•  Development  of  a  plan  for  bio  informa  tic  sand  functional  genomicstool 
development  in  agricultural  animal  research. 

•  Development  of  a  broad-based  functional  genomiesprogram  in  infectious 
animal  disease. 

•  Expanding  the  structural  genomics  infrastructure  to  include  additional 
species  of  interest. 

•  Consideration  of  genomic  s  technology  fora  nima  I  traceability  systems. 

In  the  next  year  all  agenciesparticipating  in  the  IWG  plan  to  continue  support 
of  domestic  animal  genomicsresearch  based  on  the  five-year  plan.  The  IWG 
and  participating  a  gene  ies  recognize  the  rapid  pace  of  developments  in 
animal  genomiesand  thatthere  will  likely  be  new  opportunities  requiring  IWG 
attention  as  they  become  evident.  The  membersofthe  IWG  look  forward  to 
continuing  to  work  to  effectively  coordinate  and  facilitate  fed  era  I  efforts  in  this 
important  a  tea. 
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II.  Introduction 


Domesticated  animals  have  played  a  key  role  in  human  society  through  their 
use  aslivestock,  medical  research  models,  and  human  companions.  The 
completion  of  the  human  genome  DNA  sequence  in  April  2003  marked  an 
important  milestone  in  scientific  knowledge.  The  research  arising  from  it,  called 
genomics,  providesa  new  opportunity  to  improve  and  understand 
domesticated  animals.  Once  a  genome  sequence  is  in  hand,  scientists  use  it  to 
identify  new  genes,  discoverand  understand  regulatory  elements,  and  study 
individual  genes,  their  functional  productson  a  molecular  level,  and  their 
interactions  with  othergenes.  Additionally,  the  sequence  can  be  used  to  study 
the  evolution  of  genomesthat  will  aid  in  understanding  how  molecular 
mechanismsarose. 


In  the  United  States,  livestockand  companion  animalsare  the  backbone  of 
billion-do  liar  industries.  Genomic  studiesof  domesticated  animals  will  yield  an 
understanding  of  the  geneticsand  expression  of  genes  important  to  improving 
these  spec iesforgreaterfood  yieldsand  improved  animal  health  and  human 
nutrition.  Even  more  importantly,  comparative  genomicswill  make  significant 
contributions  to  biomedical  research  and  ultimately  improved  human  health. 
Understanding  human  biology  based  on  insights  from  a  diversity  of  species  will 
lead  to  adva  nces  in  biomedical  resea  rc  h  and  it  is  antic  ipated  that  it  will 
accelerate  the  development  of  new  human  and  animal  pharmaceuticals. 


The  inclusion  of  domesticated  animals  in  a  Federal  genomicsprogram  serves 
two  primary  purposes:  1)  to  increase  the  diversity  of  genomic  sequence 
available  forstudiesthat  by  comparison  with  the  human  genome  DNA 
sequence  contribute  to  finding  new  genesand  studying  evolution  atthe 
genome  level;  a nd  2)  to  generate  genetic  data  critical  to  effic ient  a nd 
sustainable  animal  agricultural  production. 

Genome  sequencing  projectsare  completed  or  underway  forsevera  I  animals 
(including  the  laboratory  mouse,  rat,  rhesus  macaque)  thatserve  asmodelsfor 
studying  human  disease.  Projectsforsequencing  the  genomes  of  several 
domesticated  animals  (chic  ken,  cattle,  honeybee  and  dog)  have  now  entered 
the  genomic  sequencing  pipeline. 
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III.  Background 


The  charge  to  the  Interagency  Working  Group  on  Domestic  Animal  Genomics 
(IWG),  which  wasestablished  in  2002,  wasto  enhance  interagency 
communication  and  a  wareness  of  the  importance  of  agricultural  and 
companion  animal  species,  to  increase  leverage  of  investments  a  c  ross 
government  agenc  ies,  and  to  position  agriculture  asa  critical  element  of  the 
Fed  era  I  genomics  programs.  The  National  Science  a  nd  Technology  Council's 
Committee  on  Science  provides  oversight  overthe  IWG  activities.  The 
membership  of  the  IWG  consistsof  representativesfrom  the  Department  of 
Agriculture  (USDA),  Department  of  Energy  (DOE),  Food  and  Drug  Administration 
(FDA),  National  Institutesof  Flealth  (NIH),  National  Science  Foundation  (NSF), 
Office  of  Science  a  nd  Technology  Policy  (OSTP),  Office  of  Ma  nagement  a  nd 
Budget  (OMB),  and  U.S.  Agency  for  International  Development  (USAID). 

Upon  convening,  the  initial  deliberations  regarding  the  tasksetforth  to  the  IWG 
led  to  an  overarching  statement  of  purpose: 

The  mission  of  the  Interagency  Wbrking  Group  for  Domestic 
Animal  Genomics  is  to  enhance  communication  and 
awareness  of  livestock  and  companion  animal  species  of 
importance  to  the  food  and  agriculture  system;  leverage 
Federal  investments  in  large-scale  genome  sequencing  and 
genome  analysis  across  government  agenc  ies;  position  the 
food  and  agriculture  system  asa  critical  element  of  the 
national  genomics  program;  enhance  dialogue  and 
cooperation  among  Federal  agencies,  universities,  and 
industry  in  the  nation;  and  promote  international  cooperation 
on  domestic  animal  genomics  research. 

The  IWG  subsequently  identified  the  following  broad  strategic  goals: 

•  Bring  into  place  the  programmatic  elements  needed  to  advance  the  study 
and  understanding  of  domesticated  animal  genomes,  including  large-scale 
DNA  sequencing;  functional  characterization  of  expressed  genes(functional 
genomics);  tools  fordata  storage,  analysisand  visualization  (bio  informa  tics); 
and  study  of  similarities  among  genomes  of  different  spec  ies  (comparative 
genomics). 

•  Leverage  the  national  infrastructure  for  la  rge-sca  le  DNA  sequencing  that  has 
been  established  forthe  Human  Genome  Projectand  other  vertebrate  and 
model  orga  nism  genomes. 
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•  Advance  and  utilize  the  enabling  toolsand  infra  structure  of  functional 
genomicsand  bio  informa  tics  to  enhance  the  understanding  notonlyof 
basic  science  and  disease  mechanisms,  butalso  to  address  critic  a  I 
agricultural  missions,  including  animal  health  and  well-being,  food  safety, 
a  nd  huma  n  nutrition. 

•  Ensure  that  genomicsdata  are  freely  available  in  the  public  domain  and 
genomicsreagentsand  resourcesare  available  to  the  public. 

•  Increase  the  training  opportunitiesforgenomicsand  bioinformatics  at  all 
levelsof  education. 

•  Coordinate  and  encourage  international  cooperation  to  achieve  these 
goals 


The  IWG  determined  that  large-scale  sequencing,  data  managementand 
bio  informa  tics,  and  functional  genomicsare  the  specific  goalsto  be  achieved 
in  fisc  a  I  years  2003  to  2007,  inc  lud  ing : 


•  Large-scale  sequencing  to  produce  draftgenome  sequences(8-fold 
sequence  coverage)  of  honeybee,  chicken,  dog,  cattle,  swine,  and  cat. 

•  Data  managementand  bio  informa  tics  to  specifically  supportagriculturally 
important  species.  This  will  support  significa  nt  improvements  in  data 
managementand  ana  lysis  software,  allow  greaterdata  accessibility  and 
secure  long-term  maintenance,  increase  capabilities  to  deal  with  rapidly 
accumulating  data  complexity  as  data  bases  include  functional  information, 
and  provide  more  powerful  tools  to  mine  large  genomes(individually  and 
comparatively). 

•  Functional  genomicsto  specifically  investigate  agriculturally  important 
species.  An  increase  in  data  forlivestockgenomesrequiresconcomitant 
investment  in  functional  genomicsto  support  genome  annotation,  the  study 
of  gene  regulation  and  expression,  and  speciesevolutionary  relationships. 
Researchers  will  have  access  to  individual  genes,  which  can  be  cloned  and 
characterized,  the  ability  to  scan  entire  genomes  for  specific  functions,  and 
be  able  to  gain  new  insights  from  compa  risonsto  othergenomes. 
Quantitative  trait  loci  will  be  accessible  on  a  gene  level,  leading  to  studies 
thatwill  elucidate  characteristics  such  asfood  productivity  and  disease 
resistance  and  will  lead  to  the  ability  to  manipulate  those  traits  more  quickly 
and  effectively. 
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IV.  Progress  Reported  in  the  Past  Year 

Focus  on  Elucidating  Genome  Structure  and  Organization 


The  progress  reported  herein  buildsupon  an  impressive  foundation  of 
agricultural  and  bio  logic  a  I  sc  ience.  Research  efforts,  primarily  over  the  past 
century,  have  provided  a  vast  amount  of  phenotypic  and  genotypic 
information  fora  wide  a  rray  of  c  ha  ractere  in  livestock  and  companion  a  nimals. 
Long-term  large  population  studieshave  been  employed,  particularly  in  the 
area  of  livestock  production  sc  ience,  to  develop  tec  hnology  to  allow  genetic 
evaluation  and  subsequent  selection  to  be  practiced  on  breeding  populations 
in  highly  sophisticated  planned  breeding  programs,  especially  fordairy  and 
beef  cattle,  layerand  broilerchickens,  and  swine.  AsDNA-based  technologies 
were  developed  in  the  1970sand  1980s,  these  well-characterized  populations 
were  invaluable  in  providing  opportunitiesto  develop  genetic  linkage  mapsto 
begin  the  search  forgenesaffecting  important  traits.  Highly  saturated  genetic 
mapswere  developed  forall  of  the  majordomesticated  animal  speciesduring 
the  1990s,  allowing  Quantitative  Trait  Loci  (QTL)  to  be  identified  across  many 
speciesand  forma  ny  measures  of  performance.  In  parallel,  genetic  studiesof 
the  human  were  progressing  with  the  development  of  genetic  mapsthat  were 
used  to  identify  genes  important  in  human  disease.  It  wasrecognized  in  the 
late  1980s  that  progress  in  developing  too  Is  for  agriculture  I  or  biomedical 
application  could  be  greatly  accelerated  by  producing  high -re solution  physical 
mapsand  ultimately  the  genome  sequence  of  human  and  agricultural  species. 
The  genome  sequence  is  needed  in  order  to  find  all  of  the  genes  in  a  n 
organism  and  begin  to  unravel  how  genesfunction  and  interact  (i.e.  functional 
genomics).  Knowledge  of  gene  interactions  leads  to  further  research  in 
understanding  proteinsand  their  interactions  and  networks/pathways  within  the 
cell. 


The  challenge  presented  by  the  ambitiousgoalssetforthe  human  genome 
project  stimulated  rapid  improvements  in  DNA  sequencing  strategiesand 
technologiesin  the  pastdecade.  Following  the  successful  completion  of  the 
human  genome  sequence  and  substantial  progress  in  producing  the  sequence 
of  the  majormodel  speciesused  in  biomedical  research,  there  continued  to  be 
a  need  forsequencing  of  additional  mammalian  genomes,  including 
domesticated  animals,  to  be  used  asa  tool  to  find  all  of  the  genesand 
regulatory  elementsencoded  in  the  human  genome.  Tbe  approach  used, 
called  comparative  genomics,  comparesand  searchesforsimilaritiesin  DNA 
sequencesfrom  distantly  related  organisms.  Less  than  half  of  these  regions  of 
sequence  similarity  contain  genes  known  to  be  highly  conserved  throughout 
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the  evolutiona  ry  tree,  a  nd  it  is  postulated  thatthe  remaining  regions  may 
encode  important  regulatory  elements.  In  the  past  two  years,  the  National 
Human  Genome  Research  Institute  (NHGRI)  of  the  National  Institutesof  Health 
included  in  its  sequencing  program  the  sequencing  of  genomesfrom  species 
whose  sequence  wasexpected  to  help  find  all  of  the  genesand  identify 
regulatory  elements  in  the  human,  mouse,  and  rat  genomesthrough 
comparative  sequencing  and  evolutionary  biology  approaches. 

During  2002,  proposa  Is  were  submitted  to  NHGRI  to  develop  draft-quality 
genome  sequencesforthe  chicken,  cow,  honeybee,  pig,  dog,  and  cat.  All 
were  designated  to  be  "high  priority"  forsequencing  by  NHG Rl  supported 
sequencing  centers(with  the  understanding  thatclose  evolutionary 
relationships  between  dog/catand  cow/pig  would  only  allow  justification  for 
one  of  each  pair  of  species  to  be  sequenced  in  the  NHGRI  program)  based 
upon  the  value  of  their  sequence  data  to  better  understanding  of  the  human 
genome  thereby  improving  huma  n  hea  Ith.  Designation  of  high  priority  does  not 
automatically  mean  that  a  project  will  be  undertaken  by  the  NHG  Rl 
sequencing  program  and  in  several  cases,  contributions  from  partnershaving  a 
significa  nt  interest  in  obtaining  the  genomic  sequence  of  particular  org a  nisms 
have  been  required.  A  primary  emphasisof  the  IWG  since  its  inception  has 
been  how  to  bring  togetherthe  resourcesand  partnerships  to  allow  these  large- 
scale  sequencing  projects  to  come  to  fruition  fordomestic  animal  species. 


The  following  summaries  highlight  the  majoraccomplishmentsduring  2003 
along  the  pathwayto  completion  ofthe  genomics  infra  structure  needed  to 
advance  the  genomic  studiesofthe  chicken,  cow,  dog,  honeybee,  pig,  and 
cat.  The  progress  to  date  hasoccurred  asa  two-step  process  The  first  step  is 
the  completion  of  infrastructure  organizing  the  genetic  (linkage)  and  physical 
(bacterial  artificial  chromosome  (BAC)  and  radiation  hybrid  (RH)}mapsfor 
each  ofthe  important  spec  ies  The  resulting  integrated  genome  mapsthen 
become  the  foundational  "scaffolding"  upon  which  whole  genome 
sequencing  is  performed— resulting  in  the  ultimate  full  sequence  map  ofthe 
genome. 

Genetic  and  Physical  Genome  Maps 

International  Consortia  Develop  Bovine  and  Porcine  BAC  Maps 

The  availability  of  physical  maps,  and  eventually  the  DNA  sequence  ofthe 
bovine  and  porcine  genomes,  will  transform  genetic  research  in  these  species. 
Such  mapswill  greatly  facilitate  the  identification  of  genesaffecting  production 
traits,  disease  susceptibility,  animal  health  and  product  quality  and  nutritional 
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value.  The  elucidation  of  these  genomes  will  also  benefit  human  medicine  by 
identifying  novel  genes  and  their  function  that  are  critic  a  I  for  huma  n  hea  Ith.  It  is 
important  that  the  genetic  and  physical  maps,  and  the  reagents  used  to  build 
them,  are  freely  accessible  and  in  the  public  domain.  The  bovine  and  swine 
genomics  research  communities  need  additional  laboratory  resourcesto  allow 
more  effic  ient  use  of  current  huma  n  and  mouse  mapping  information  to  solve 
production  inefficiency  and  health  problemsand  contribute  to  increasing  the 
global  competitiveness  of  these  livestock  industries. 

Bovine  BAC  Map 

An  international  effort  (International  Bovine  BAC  Mapping  Consortium;  IBBMC) 
wasinitiated  to  develop  a  physical  BAC-based  map  of  the  bovine  genome. 
Clonesfrom  three  publicly  available  bovine  BAC  libraries  were  fingerprinted 
and  end-sequenced  to  develop  the  bovine  BAC  map.  The  three  BAC  libraries 
were  generated  from  three  common  breeds,  Holstein,  Hereford,  and  Angus.  A 
total  of  295,000  clones  have  been  fingerprinted  (15.8  Xcoverage)  from  these 
three  libraries.  Additional  fingerprinting  (100,000  c  lones)  wasperformed  on  a 
fourth  BAC  library  by  INRA  (LTnstitut  National  de  la  Recherche  Agronomique), 
France  and  isbeing  end-sequenced  (12,000  clones  to  date).  Overall,  a  total  of 
160,000  c  lones  have  been  sequenced  with  140,000  cloneshaving  both  ends 
sequenced.  End  sequencing  information  from  an  additional  100,000  clonesby 
the  IN  FRA  and  international  effort  will  be  added  and  integrated  in  the  near 
future.  All  sequence  data  are  publicly  available  (www.ncbi.nih.gov/Genbank). 


The  construction  of  the  integrated  BAC  map  is  in  progress.  The  initial  contig 
assembly  generated  14,000  contigs  with  2  or  more  c  lones  per  contig  and  30,000 
singletons.  By  utilizing  the  information  in  the  human  map  and  sequence,  the 
contigswere  merged  reducing  the  numberto  2,622  with  621  of  these  contigs 
containing  199,326  clonesthatare  anchored  to  human  chromosomes.  A  2,000 
marker  radiation  hybrid  (RH)  map  hasbeen  generated  that  will  be  used  to 
furthermerge  contigsand  a nc horcontigsto  the  human  and  bovine  map. 
Publication  of  the  bovine  BAC  map  is  planned  for  late  2004. 


The  BAC  map  wasused  asone  of  the  justifications  to  sequence  the  bovine 
genome  since  this  would  be  the  first  genome  to  be  sequenced  with  a  BAC  map 
nearly  completed  priorto  the  start  of  sequencing.  The  sequencing  of  the 
bovine  genome  hasnow  been  funded  and  an  element  of  the  sequencing 
strategy  will  be  to  perform  "light"  sequencing  of  BAC  clones  a  cross  the 
genome.  19,000  clones  have  been  selected  from  the  Hereford  library  based 
upon  the  BAC  map  forthis purpose.  Additionally,  selected  BACsfrom  regionsof 
high  research  interest  will  be  chosen  for  high-quality  sequencing. 
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The  participants  of  the  IBBMC  include  USDA/ARS  (Beltsville,  Maryland  and  Clay 
Center,  Nebraska),  TexasA&M  University,  University  of  Illinois,  The  Institute  for 
Genomic  Research,  Children'sHospital  Oakland  Research  Institute,  The 
University  of  Alberta  and  Alberta  cattlemen  (Alberta  Uvestock  Genomics 
Initiative),  INRA  (France),  AgResearch  (New  Zealand),  CSIRO  (Australia), 
EMBRAPA  (Brazil),  Roslin  Institute  (Scotland),  British  Columbia  CancerAgency 
Genome  Sciences  Centre  (Canada),  and  the  Alliance  for  Animal  Genomics 
Research. 

Swine  BAC  Map 

The  swine  genomics  research  community  organized  a  similar  and  pa  ra  lie  I  effort 
to  that  employed  by  the  international  bovine  consortium  to  produce  a  porcine 
BAC  map.  The  participants  in  the  International  Porcine  BAC  Mapping 
Consortium  (IPBMC)  include  the  University  of  Illinois,  USDA/ARS  U.S.  Meat  Animal 
Research  Center  Roslin  Intitute,  Sanger  Institute  (U.K.),  INRA,  and  the  Korean 
Uvestock  Institute. 


Clonesfrom  five  available  BAC  libraries  a  re  being  fingerprinted  and  end- 
sequenced  in  the  swine  effort  (generated  from  Duroc,  Korean,  Meishan  and 
Large  White  breeds).  Thusfar  a  total  of  300,300  cloneshave  been  fingerprinted 
with  225,000  c  lones  having  both  ends  sequenced.  Additional  c  tones  a  re 
scheduled  forend-sequenc ing  in  the  spring  of  2004.  Similartothe  model  used 
by  the  bovine  BAC-map  consortium,  the  porcine  map  will  be  anchored  to 
human  chromosomes  using  a  RH  map  currently  being  produced  by  integration 
of  the  12,000-rad  INRA  and  7,000-rad  IN  RA-University  of  Minnesota  RH  panels. 

All  sequence  data  from  thiseffort  are  being  deposited  in  GenBank 
(www.ncbi.nih.gov/Genbank). 

Construction  ofthe  integrated  maps  is  to  be  conducted  atthe  Sanger  Institute 
during  the  second  quarter  of  2004.  The  IPBMC  will  meetatthe  Sanger  Institute 
in J  une  2004 to  review  the  initial  map  and  to  finalize  plans forpublication  ofthe 
integrated  BAC  map.  The  first  public  presentation  of  the  integrated  map  is 
slated  for  the  Internationa  I  Society  of  Animal  Genetics  meetings  in  Tokyo  in 
September  2004. 

The  plansof  the  IPBMC  also  include  contribution  of  a  minimum  tiling  path  of 
clonesfrom  the  Duroc  library  to  be  eventually  used  in  developing  the  pore ine 
whole  genome  sequence  assembly  (see  subsequent  a  rtic  le  on  page  20  of  this 
report). 
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Integrated  Maps  Nearing  Completion  for  Cattle,  Pig,  and  Chicken 

Integration  of  genetic  linkage  maps  with  physical  mapsand  sequences  isan 
essential  component  of  efforts  to  link  traits  of  agricultural  interest  (productivity, 
disease  resistance,  etc.)  to  the  genesencoding  a llelesthat  determine  the 
phenotype.  This  linkage  requiresgenesormarkersthat  are  both  polymorphic 
(capable  of  linkage  mapping  in  populations)  and  easily  assignable  to  physical 
maps.  While  there  isgrowing  interest  in  single  nucleotide  polymorphisms (SNPs), 
simple  tandem  repeats  (STRs,  also  called  "micro  sate  Hites")  have  been  most 
heavily  employed  in  domestic  animals,  due  to  theirwidely  polymorphic  nature. 

Statistical  approachesto  build  integrated  ma ps for  livestock  speciescan 
consolidate  information  from  independent  mapsand  capitalize  on 
complementary  resolution  of  physical  and  genetic  data  sets  to  resolve  marker 
order.  Genetic  data  used  forlinkage  mapping  can  position  markers  separated 
by  recombination  along  a  chromosome,  but  lacks  the  resolution  necessary  to 
reliably  orderclose  markers.  Physical  mapping  data  sets,  such  asradiation 
hybrid  (RH)  pa  nels,  can  reliably  ord  ere  lose  markers,  but  lack  long-ra  nge 
resolution  to  ordergroupsof  widely  separated  markers.  Simultaneously  solved 
maps  representing  both  linkage  and  RH  data  take  advantage  of  each  to  order 
markers  more  reliably. 


The  first  likelihood-based  whole-genome  bovine  map  integrating  linkage  and 
RHdata  wascompleted  by  USDA/ARS's  Roman  L.  Hruska  U.S.  Meat  Animal 
Research  Center(MARC)  at  C lay  Center,  Nebraska,  and  is  now  available  to  the 
Internationa  I  Bovine  BAC  Ma  p  Consortium.  The  map  combines  data  from  two 
radiation  hybrid  panels,  the  3000-rad  European  Union  (COMRAD)  panel  and  the 
5000-rad  Texas  A&M/University  of  Illinois  panel,  with  linkage  data  from  the  MARC 
reference  families.  Over6,300  distinct  markersare  represented,  with  1,381  of 
these  represented  in  the  linkage  and  at  least  one  RH  data  set.  Alignment 
between  2,061  markersequencesand  human  genomic  sequence  providesthe 
basisfora  bovine-human  comparative  map.  Overlap  between  markerand 
BAC  end  sequence  alignmentson  the  human  genome  connects  1,504  markers 
to  the  bovine  BAC  map,  augmenting  574  markers  physically  assigned  to  BAC 
clones.  Besides  providing  a  bovine  orderto  anchor  BAC  contig  assembly  and 
ultimately  guide  bovine  sequence  assembly,  the  integrated  map  hashad  a 
significant  positive  impact  on  the  efficiency  of  the  SNP  discovery  process  to 
develop  new  markers  targeted  to  chromosomal  regions  known  to  harbor  loci 
influencing  economically  important  traits.  Furtherdevelopment  of  the 
integrated  bovine  map  will  incorporate  BAC  data,  providing  a nother  level  of 
resolution  to  refine  markerorder. 
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The  swine  whole-genome  integrated  map  hasbeen  constructed  in 
collaboration  with  Fra  nee's  INRA.  More  than  l.OOOgenetic  markers  from  MARC 
were  included  in  the  analyseswith  over  3,000  markers  ran  across  the  INRA- 
University  of  Minnesota  pig  RH  7,000-rad  panel.  Approximately  750  markers  were 
inc  luded  in  both  data  sets.  The  genetic  map  developed  at  MARC  has  been 
dramatically  expanded  recently  and  now  contains  over  3,000  typed  loci.  One 
thousand  of  the  new  markersare  single  nucleotide  polymorphismsdeveloped 
from  EST sequence  data  generated  in  the  pig.  These  markersare  crucial  to 
developing  a  high-resolution  comparative  map  and  permit  integration  of  the 
genetic,  RH  and  BAC  maps  in  the  nearfuture. 


Genetic  linkage  mapsforthe  chicken  expanded  rapidly  in  the  1990swith  the 
application  of  DNA-based  marker  systems.  The  consensus  linkage  map  (Schmid 
et  at,  Cytogenet.  Cell  Genet.  90,  2000)  covers  a  bout  4000  c  M,  including  over 
2000  markers.  Early  efforts  to  generate  a  physical  map  of  the  chicken  genome 
led  to  the  development  of  several  BAC  libraries(Crooijmansetal.,  Mammal 
Genome,  11,  2000;  Lee  et  al„  Anim.  Gent.  34,  2003).  These  now  have  been  used 
to  generate  genome-wide  BAC  contig  mapsthat  place  chicken  genesand 
markers  in  specific  molecular  locations  (Ren  etal.,  Genome  Res.,  13,  2003; 

Wa  rren  et  at,  manuscript  in  prepa  ration).  A  first  generation  BAC  map,  partially 
integrated  to  the  linkage  map,  isnow  available  (Ren  etal.,  Genome  Research 
13:2754)  a  nd  a  muc  h-improved  version  will  be  published  soon  by  the 
Washington  University  Genome  Sequencing  Center.  This  map  will  be  integrated 
both  to  the  linkage  map  and  to  the  full  genome  draft  sequence  using  the  same 
approachesaspreviously  described  forcattle  and  swine. 

G  iven  the  draft  level  of  the  genome  sequences  now  assembled  for  the 
chicken,  in  progress forthe  cow  and  proposed  forthe  pig,  map  integration 
effortswill  be  important  tools  in  assessing  the  quality  of  the  sequence 
assemblies.  The  genome  sequences  will  also  provide  marker  sets  (both  SNP  and 
STR)  thatwill  permit  much  higher  resolution  in  future  QTL  mapping  efforts  forthe 
purposes  of  mechanistic  ana  lyses  and  marker-assisted  selection. 

Domestic  Animal  EST  Sequences  in  the  Public  Database  Growing 

The  number  of  ESTs  (Expressed  Sequence  Tags)  in  the  dbESTdatabase  from 
domestic  animals(http://www. ncbi.nlm.nih.gov/dbEST/)  continuesto  increase 
ata  rapid  rate.  As  of  February  2004,  1,530,000  ESTs  had  been  deposited  with 
the  largest  numbercoming  from  the  chicken,  cattle,  and  swine  research 
communities. 
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Researchers  are  using  the  EST sequence  information  in  dbESTin  an  increasing 
number  of  ways,  particularly  as  mapping  tools  and  for  the  development  of 
gene  expression  profiling  tools.  The  chicken  isan  excellent  example  of  progress 
in  thisarea.  It  isestimated  thata  total  of  over600,000  chicken  EST sequences 
have  been  determined  (Burtand  Pourquie,  Science,  300,  2003)  with  more  on 
the  way.  Chicken  homologuesto  most  mammalian  genescan  now  be  found  in 
silico  by  searching  the  chicken  ESTand/orthe  no n-redund a  nt  sequence 
databases(e.g.,  http://www.ncbi.nlm.nih.gov/blast).  Brown  etal.  (Nature  Rev. 
Genet.  4,  2003)  reported  thatthe  chicken  EST set  already  available  showed 
strong  matchesto  over90%of  the  human  disease  genes  in  the  "morbid  map" 
of  the  Online  Mendelian  Inheritance  in  Man  database  (http://www.ncbi.nlm. 
nih.gov/Omim).  Chicken  EST contigs provide  the  basis forthe  construction  of 
microarrays,  used  in  global  profiling  of  gene  expression.  Chicken  gene 
microa  rrays  ha  ve  been  used  to  analyze  changes  in  mRNA  levels  in  response  to 
infection  with  Marek'sDisease  Virus(Liu  etal.,  Animal  Genet.  32,  2001)  and 
Eimeria  (Min  etal.,  Appl.  Microbiol.  Biotec hnol.  62,  2003)  and  in  neoplasias 
induced  by  c-myc  and  c-myb  over-expression  (Neiman  etal.,  Proc.  Natl.  Acad. 
Sc  i.  U.S.A.,  98,  2001,  2003).  A  13,000  transcript  chicken  mic  roa  rray  (g  las  slide)  is 
available  from  the  Fred  Hutchinson  Cancer  Research  Center  Genomics  Facility, 
and  more  complete  a  rrays  using  a  variety  of  platforms  a  re  being  pla  nned. 
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Number  of  EST  Entries  in  GenBank  from  Domestic  Animals 
(data  from  dbEST/GenBank/NCBI  as  of  April  2004) 
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Genome  Sequencing 


Initial  Draft  of  the  Domestic  Honeybee  Genome  Assembled 

The  honeybee  (Apismeliferia)  genome  is  the  latest  insect  species,  afterthe 
mosquito  and  flies,  to  be  fully  sequenced  and  made  available  to  the  public. 
Sequencesfrom  the  honeybee  will  inform  the  medical  sciencesby  improving 
research  based  on  the  Drosophila  insect  model,  as  well  as  providing  information 
about  behavior,  development,  and  human  immune  reaction.  The  draft 
honeybee  genome  sequence  isexpected  to  provide  insights  into  the  impressive 
learning  ability  of  these  insectsand  mayprovide  insights  into  learning  and 
neurological  diseases  of  humans.  Additionally,  human  immune  response  to 
honeybee  venom  isan  important  medical  problem  in  the  United  States.  The 
honeybee  sequence  may  also  provide  insights  into  important  medical  questions 
related  to  aging  and  reproduction.  The  honeybee  genome  sequence  will  also 
be  used  to  address  critical  problems  related  to  the  agricultural  importance  of 
this  species,  ranging  from  disease  resistance  to  nutrition,  reproduction,  and 
behavior.  Finally,  asa  more  distant  relative  of  the  three  fly  specieswhose 
genomesare  available,  honeybees  provide  a  new  reference  point  for 
comparative  genomicsacrossthe  insects,  a  group  of  worldwide  importance  for 
both  human  and  ecosystem  health.  The  request  to  NHGRI  for  sequencing  of 
the  honeybee  can  be  found  at  http://www.genome.gov/10002154. 


The  honeybee  genome  sequence  was  produced  from  J  a  nua  ry  to  August  of 
2003,  atthe  BaylorCollege  of  Medic ine'sHuman  Genome  Sequencing  Center 
and  includes  both  short-insert  or  shotgun  clones(2  million  sequence  reads)  and 
end  sequencesfrom  BAC  c  lones  (1.3  million  sequence  reads).  The  shotgun 
sequence  wascompiled  into  an  initial  assembly  (labeled  Release  1.0)  on 
Decemberll,  2003.  This  assembly  was  improved  and  released  asversion  1.1  in 
J  anuary  2004and  is  now  being  analyzed  by  the  insect  genomicscommunity. 
Assembly  1.1  includes  11,600  scaffoldscovering  a  genome  of  206  Mbp,  with  an 
N50  length  of  223  kb.  Forty  percent  of  the  assembly  is  covered  by  a  nc  ho  red 
and  oriented  scaffolds(n  =197).  94%ofthe  1,050  known  honeybee  markersare 
present  in  the  assembly  along  with  whole  orpartial  sequencesfor95%of  genes 
described  previously.  Automated  gene  predictionsusing  thisand  future 
assemblies  will  provide  lists  of  known  and  de  novo  genesfor  honeybees.  These 
predictions  will  be  followed  by  effortsat  Baylorand  other  institutions  to  manually 
annotate  genesof  interest  to  the  medical  and  agricultural  communities. 


Assembly  1.1  used  existing  markers  from  known  honeybee  chromosomes  to 
establish  a  physical  map  for  bees,  a  product  that  will  prove  useful  both  for 
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placing  genesof  interestand  foragricultural  goals  such  as  marker-assisted 
breeding.  This  map  will  evaluate  the  need  to  fill  sequence  gapsin  critical 
areas,  and  will  help  predict  the  feasibility  of  marker  or  locus-specific  breeding 
tools  for  honeybees.  Future  assemblies  will  be  improved  by  the  addition  of  BAC 
end  sequencesand  othersequencesgenerated  to  improve  the  sequence 
depth  in  targeted  regions. 

Funding  forthisprojectcame  from  NHG Rl  ($6.9M)  and  USDA  ($750K). 


(Viewable  on  the  web  at:  http://hgsc.bcm.tmc.edu/projects/honeybee/) 


Chicken  Genome  Drafted  at  Washington  University 

The  chicken  is  the  firstavian  species  for  which  whole  genome  sequence  data  is 
available  in  the  public  domain.  When  first  proposed  to  NHGRI  in  2002  (see 
original  white  pa  per  proposa  I  at:  http  ://genome.wustl.edu/projects,c  hie  ken/ 
Chicken_Genome.pdf),  the  chicken  was  identified  asan  extremely  valuable 
speciesforgenome  sequencing  due  to:  1)  itspremierstatusasa  primary  model 
organism  forstudiesin  embryology  a  nd  development,  virusesand  cancer, 
immunology,  and  gene  regulation;  2)  novel  aspectsof  itsbiology  relevant  to 
human  biology  such  asthe  7J\N  sex  determination  system  and  high 
microchromosome  content;  3)  its  intermediate  evolutiona  ry  position  between 
the  mouse  and  fugu  genomes;  4)  its  potential  for  providing  a  better 
understanding  of  the  evolution  of  gene  orderand  arrangement;  5)  the  wealth 
of  genomicsresourcesavailable  from  the  chicken  research  community;  and  6) 
itsclearand  growing  importance  asa  source  of  animal  protein  in  world 
agricultural  production  and  the  food  system.  Recent  outbreaks  of  avian  flu 
have  accelerated  scientists'  interest  in  learning  more  about  the  chicken 
genome  and  how  genetic  variation  may  play  a  role  in  the  susceptibility  of 
different  strains  to  the  disease. 


On  March  1,  2004,  NHGRI  announced  thata  team  led  by  Richard  Wilson,  Ph.D., 
from  the  Washington  University  School  of  Medicine  in  St.  Louis  successfully 
assembled  the  genome  of  the  Red  J  ungle  Fowl,  Gallusgallus,  which  is  the 
ancestorof  domestic  chickens.  Sequencing  of  the  chicken  genome  began  in 
March  2003.  NHG Rl  provided  about  $13  million  in  funding  forthe  project. 

The  Washington  University  researchers  have  deposited  the  initial  assembly, 
which  isbased  on  seven-fold  sequence  coverage  of  the  chicken  genome 
(approximately  1  billion  bases),  into  GenBa nk (www.ncbi.nih.gov/Genbank).  In 
turn,  GenBa  nk  will  distribute  the  sequence  data  to  the  European  Molecular 
Biology  Laboratory's  Nuc leotide  Sequence  Database,  EMBL-Bank 
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(www.ebi.ac.uk/embl/index.html),  and  the  DNA  Data  BankofJ  apan,  DDBJ 
(www.ddbj.nig.ac.jp). 

To  facilitate  comparative  genomic  analysis,  the  researchers  a  Iso  have  aligned 
the  draft  version  of  the  chicken  sequence  with  the  human  sequence.  Those 
alignments  can  be  scanned  using  the  University  of  California,  Santa  Cruz's 
Genome  Browser,  (http://genome.ucsc.edu/cgi-bin/hgGateway);  the  National 
Centerfor  Biotec  hnology  Information's  Map  Viewer  (www.ncbi.nlm.nih.gov/ 
mapview);and  the  European  Bio  informa  tics  Institute's  Ensembl  system, 
(www.ensembl.org). 

In  addition,  using  the  Gallusgallus genome  sequence  assembled  by 
Washington  University  asa  reference  framework,  an  international  team,  led  by 
the  Beijing  Genomics  Institute  in  China  and  supported  by  the  Wellcome  Trust  in 
Great  Britain,  hascreated  a  map  of  genetic  variation  forthree  different  strains 
of  domestic  chickens.  The  strainswerea  broilerstra  in  from  the  United  Kingdom, 
a  layerstrain  from  Sweden  and  a  Silkie  strain  from  China.  To  make  the  map, 
researchers  identified  and  analyzed  about  2  million  genetic  variation  sites, 
mostly  SNPs.  The  genetic  variation  data  will  soon  be  deposited  into  GenBank, 
from  which  the  data  will  be  freely  accessible  to  researchers  worldwide. 


The  projectteam  hasproposed  to  conducta  pre-finishing  phase  in  2004 to 
c  lose  gaps,  extend  the  size  of  sequence  contigsand  enhance  overa  II  quality  of 
the  draft  sequence.  A  crude  analogy  would  be  if  the  sequence  is  viewed  asa 
book,  pre-finishing  takesthe  productfrom  an  unbound  collection  of  thousands 
of  loose-leaf  pages  to  a  n  ordered  a  nd  bound  ma  nuscript.  This  process  will 
provide  a  much  more  valuable  and  reliable  product  for  end  users  and  is  very 
cost  effective,  asthe  clone  libraries  remain  readily  available  forautomated  re¬ 
sorting.  Funding  forthe  pre-finishing  project  isunderconsideration  by  USDA. 


For  further  information  a  bout  the  chicken  genome  project  see  (http:// 
genome.wustl.edu/projects/chicken). 

Broad  Institute,  MIT  Well  Underway  in  Sequencing  the  Dog  Genome 

Dogsare  unique  among  mammalian  speciesin  the  extent  of  variation  they 
show  in  morphologica  I  tra  its  such  asheight,  weight,  mass,  shape,  and  behavior. 
Yet,  within  each  breed,  keytraitsare  inherited  within  extremely  narrow  limits.  No 
other  mammalian  species  presents  nature  I  variation  on  such  a  scale,  yet 
individuals  from  nearly  any  breedscan  be  mated  to  yield  fertile  offspring. 
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Given  the  aggressive  breeding  programsneeded  to  reproduc ibly  generate 
animals  of  distinctive  size,  shape  and  behavior,  it  is  not  surprising  that  purebred 
dog  fanciers  have  also  produced  closed  breeding  populations,  which  carry 
over400  inherited  disorders.  Genetic  diseasesare  predicted  to  occurwith  high 
frequency  in  populations  with  closed  gene  poolsand  in  which  breeding  of 
close  relatives  is  used  to  propagate  desired  traits.  Breeds  established  from  a 
small  numberof  foundersand  expanded  rapidly  to  meet  breeders'  and 
consumers'  demands  suffer  the  mostdue  to  a  combination  of  random  driftand 
inbreeding  depression.  Autosomal  recessive  and  complextraitspresentthe 
biggest  problem  asthe  status  of  asymptotic  ca  rriers  may  not  be  suspected  until 
several  litters  have  been  produced.  This  includes  diseases  such  ascancer,  heart 
disease,  deafness,  blindness,  motorneuron  disease,  skin  disorders,  and  a  host  of 
autoimmune  disorders,  each  of  which  hasbeen  difficult  to  study  in  humans. 

The  physiology,  disease  presentation  and  clinical  response  of  dogs  often  mimic 
huma  n  diseasesc  losely.  Thus,  positional  c  loning  of  genetic  diseases  in  the  dog 
can  inform  usaboutthe  genetic  basisorsusceptibility  to  similar  diseases  in 
humans. 


Because  of  the  potential  of  the  dog  asa  model  forstudying  human  disease, 
the  sequencing  of  the  dog  genome  wasapproved  by  NHGRI  in  2002  and  is 
now  nearing  completion  atthe  Broad  Institute,  MIT  A  female  Boxernamed 
Tasha  waschosen  forsequencing  because  among  the  breedsexamined  by  the 
Broad  investigatorsthe  Boxer  hasthe  lowest  a  mount  of  variation  in  itsgenome 
and  therefore  is  likely  to  give  an  easily  assembled  genome  sequence. 


The  sequencing  ofTasha  hasnow  been  completed  to  6.5-fold  coverage  of  the 
genome  and  sequence  assembly  isunderway.  The  sequence  tracesare 
available  at  http://www.ncbi.nih/Traces/trace.cgi?.  The  assembly  will  be 
posted  in  late  spring  of  2004and  will  be  aligned  against  the  human  sequence. 
Automated  annotation  of  the  genome  will  also  be  provided.  Tasha'ssequence 
will  then  be  compared  to  100,000  sequence  readsfrom  20  otherbreedsto 
generate  approximately  25,000  SNPsto  be  used  in  genetic  studies  of  the  dog. 


For  more  information  a  bout  the  dog  genome  project  see:  http:// 
www.broad.mit.edu/media/2003/pr_03_tasha  .html. 
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Unprecedented  Partnership  Launches  the  Bovine  Genome  Sequencing 
Project 

On  December  12,  2003,  Secretary  of  Agriculture  Ann  Veneman  and  Francis 
Collins,  Directorof  the  National  Human  Genome  Research  Institute  of  NIH,  jointly 
announced  the  launching  of  the  International  Bovine  Genome  Sequencing 
Project  (BGSP).  This  milestone  wasreached  aftera  monumental  effort  to  build 
an  international  consortium  to  fund  the  $53M  project. 

In  2002,  NHGRI  identified  the  cow  asa  high  priority  through  itsproposal  review 
processjwhite  paperproposal  can  be  viewed  at  http://www.genome.gov/ 
10002154).  Atthattime,  NHGRI  indicated  that  its  support  forthe  bovine  genome 
project wasdependent upon  raising  matching  funds.  Overthe  course  ofthe 
following  year  the  IWG,  with  the  assistance  ofthe  Alliance  forAnimal  Genomics 
Research,  worked  to  bring  togetheran  international  consortium  to  provide  the 
matching  funds  needed  forthe  project.  The  International  Bovine  Genome 
Sequencing  Consortium  (IBGSC)  wasformed  ofthe  funding  agenciesthatare 
contributing  fundsto  match  the  total  needed  forthe  project.  The  group 
consists  of  (with  funding  contribution):  National  Institutes  of  Health,  NHGRI 
($25M),  USDA  ($11M),  State  of Texas($10M),  Genome  Canada  ($5M),  Australia's 
CSIRO  ($1M),  Ag.  Research  New  Zealand  ($1M),  and  US  Cattlemens  Beef  Board, 
Texas,  and  South  Dakota  Beef  Councils  ($820K).  The  project  isunderway  atthe 
BaylorCollege  of  Medic ine's Huma n  Genome  Sequencing  Center,  Houston,  TX 
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(BCM-HGSC),  utilizing  the  international  bovine  BAC  map  asthe  scaffolding 
upon  which  to  do  the  sequencing  (see  previous  section  of  this  report). 

The  goals  of  the  project  are  to  produce  a  draft  sequence  coverage,  assembly 
and  primary  automated  annotation  of  the  bovine  genome,  including  SNP 
discovery,  full-length  cDNA  sequencing  and  finishing  of  a  selected  set  of  BACs 
The  genomic  sequencing  goal  is  6-fold  coverage  (WGSand  BAC  skims 
combined)  with  an  additional  1-fold  coverage  from  WGSSNP  discovery 
sequencing.  This  draft  coverage  will  be  augmented  by  finishing  of  75  Mb  of 
sequence  from  regions thatconta in  genes  influencing  important  traits.  The 
cDNA  sequencing  will  provide  nearly  10,000  sequences. 


The  goal  of  Phase  I  will  be  to  produce  and  assemble  a  draft  bovine  genome 
sequence,  inc  luding  a  SNP  discovery  component  and  production  of  full-length 
cDNA  sequences  The  BCM-HGSC  isthe  lead  centerin  thisphase  and  will 
coordinate  the  overall  conduct  of  the  project.  The  Genome  British  Columbia 
Sequencing  and  Mapping  Platform  (GBCSMP)  atthe  British  Columbia  Cancer 
Agency  (BCCA)  will  participate  in  thisphase  of  the  project  through  production 
of  full-length  bovine  cDNAs.  The  annotation  of  the  bovine  genome  sequence, 
although  notfunded  through  the  IBGSC,  isbeing  carried  out  by  the  European 
Bio  informa  tics  Institute  (EBI)  with  funding  from  the  Biotechnology  and  Biological 
Sciences  Research  Council  (UK)  in  collaboration  with  the  BCM-HGSC.  Phase  I  of 
the  project  is  targeted  for  completion  in  late  2005. 


Project  oversight  is  being  performed  by  a  Bovine  Genome  Sequencing  Project 
Advisory  Committee  (BGSPAC).  Thiscommittee  iscomprised  of  representatives 
from  each  contributing  funding  agency,  a  scientist  chosen  by  thatfunding 
agency  and  a  representative  from  each  sequencing  center.  Ex-officio 
members  include  a  representative  of  the  Alliance  for  Animal  Genomics 
Research,  and  a  representative  of  the  authorsofthe  "white  paper"  proposal. 
Steven  Kappes(MARC)  and  George  We  instock  (BCM-HGSC)  are  co-chairs  of 
thiscommittee. 


Asof  March  1,  2004,  a  total  of  2,500,000  sequence  readshad  been  deposited  in 
the  trace  archive  (amounting  to  ~0.5-fold  sequence  coverage).  The  sequence 
isavailable  at  http://www.ncbi.nig.gov/Traces/trace.cgi?.  Workon  the  BAC 
tiling  path  had  also  commenced  with  the  construction  of  shotgun  libraries. 
Effortsduring  the  first  three  quartersof  2004  are  being  focused  on  completion  of 
the  whole  genome  shotgun  portion  of  the  project.  Additionally,  planning  of  the 
SNP  discovery  and  full-length  cDNA  portionsof  the  effort  are  underway  by  the 
BG  SPAC . 
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Further  information  about  the  bovine  genome  project  isavailable  at:  http:// 
hgsc.bcm.tmc.edu/projects/bovine/. 

International  Swine  Genome  Sequencing  Consortium  Formed 

Recent  completion  of  the  human  genome  sequence  providesthe  starting  point 
for  understanding  the  genetic  complexity  of  humansand  how  genetic  variation 
contributesto  diverse  traitsand  disease.  Additional  species  must  be 
sequenced  to  resolve  the  genetic  complexity  of  huma  n  evolution  and  to 
effectively  use  genetic  information  in  veterina ry  a nd  human  medicine.  The  pig 
occupiesan  evolutionary  position  within  mamma  Is  distinct  from  primatesand 
rodents  but  close  to  the  bovine  and  dog.  Its  importa  nee  in  biomedical 
research  and  agriculture  hasled  to  the  development  of  genomic  resources 
that  will  be  extremely  important  to  the  research  community  studying  the  pig. 

The  pig  has  been  widely  used  as  a  biomedical  model  for  many  years,  a  nd  is 
expected  to  play  a  critical  role  in  research  directed  toward  understanding 
huma  n  obesity. 

Consumption  of  pork  continues  to  grow  worldwide  and  pork  represents  43%of 
all  red  meateaten  in  the  world.  Given  its  importa  nee  in  modem  animal 
agriculture,  obtaining  the  complete  genome  sequence  ofthe  pig  is  imperative. 
Such  scientific  information  will  lead  to  improved  performance  and  production, 
decreased  problemswith  porcine  diseasesand  reduced  environmental  and 
welfare  concerns. 


Leaders  from  the  pig  geneticscommunity  met  in  France  in  September  2003  a  nd 
established  the  International  Swine  Genome  Sequencing  Consortium  (ISGSC) 
with  representation  from  the  US,  UK,  France,  Denmark,  Korea,  Japan,  and 
China.  The  goals  ofthe  ISGSC  are  to  coordinate  the  scientific  and  funding 
activities  associated  with  sequencing  the  pig  genome.  The  scientific  strategy 
includes:  (1)  contig  sequencing  (~2X  coverage)  and  the  assembly  of  all 
sequencesata  majorgenome  sequencing  centerand  (2)  whole  genome 
shotgun  sequenc  ing  (~5X  coverage)  that  will  be  conducted  at  va  riouscenters 
around  the  world.  All  data  will  be  rapidly  placed  in  the  public  domain.  Larry 
Schookfrom  the  University  of  Illinois  is  serving  aschairof  the  consortium. 

Similarto  the  approach  being  utilized  in  the  bovine  genome  sequencing 
project,  the  internationa  I  swine  BAC  map  will  be  used  asthe  scaffolding  forthe 
sequencing  effort.  The  goal  ofthe  ISGSC  is  to  secure  the  necessary  funding 
required  (approximately  $40M)  forthe  project  by  October2004. 

Commitments  thusfarto  the  project  include  release  ofthe  ~.7-fold  whole 
genome  shotgun  sequence  from  the  Sino-Danish  pig  project  ($5M  in  kind),  a 
pledge  from  Korea  to  contribute  WGSat  NLRI  ($2M  in  kind),  and  release  of  BAC- 
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end  sequence  from  the  Sa ngerCenter  ($3M  in  kind),  Roslin  Institute  ($1M  in 
kind),  Iowa  State  University  ($0.3M),  and  the  University  of  Illinois ($1M  in  kind). 
Meetingshave  been  held  with  USDA,  the  U.S.  pork  industry  (Pork  Board  and 
National  Pork ProducersCouncil),  the  European  Commission,  Genome 
Canada,  the  Wellcome  Trust,  BBSRC,  DEFRA,  INRA  and  the  Danish,  Canadian, 
and  UK  pork  industries  to  explore  potential  direct  funding  ofthe  project. 


Genomics-Derived  Tools  Critical  in  US  BSE  (Mad  Cow 
Disease)  Crisis 

When  the  first  indexcase  of  bovine  spongiform  encephalopathy  (BSE),  more 
commonly  referred  to  as"mad  cow  disease,"  wasreported  in  Washington  state 
in  Decemberof  2003,  little  did  anyone  know  that  it  would  become  one  ofthe 
first  majorcasesforthe  application  of  genomicstechnology  to  providing  a 
valuable  and  rapid  solution  in  a  crisis  situation  in  animal  agriculture.  Following 
the  identification  ofthe  diagnosisof  thisanimal,  U.S.  beef  market  prices 
dropped  prec  ipitously,  due  to  pressure  imposed  by  the  closure  of  most  ofthe 
majorexport  markets  to  U.S.  beef. 


Shortly  after  the  identification  of  the  BSE  case,  recordsassocia  ted  with  the 
particularcow  in  question  indicated  that  she  wasoriginally  imported  from 
Alberta,  Canada,  where  a  similarcase  of  BSE  had  been  identified  several 
months  earlier.  However  the  recordsdid  not  unequivoc  a  lly  prove  her  origin. 
Fortunately,  the  records  identified  the  sire  and  other  relatives  of  the  BSE  index 
case;  furthermore,  their  tissues  were  available  forgenetic  analyses.  The 
availability  of  these  tissue  samples  permitted  rapid  pedigree  verification,  using 
genomic  technology,  and  thusconfirmed  the  records  indicating  the  Canadian 
origin  ofthe  BSE  indexcow. 

The  technology  used  in  thiscase  wasdeveloped  in  2002  by  ARS scientists  Mike 
Fleaton  and  coworkersatthe  USDA/ARSU.S.  Meat  Animal  Research  Center 
ClayCentet  Nebraska.  They  identified  32  selected  single  nucleotide 
polymorphisms (SNPs)  in  cattle  that  provided  very  accurate  identification  of 
individual  animals.  ThisSNP  battery  wasderived  from  hundreds  of  SNPs  and 
chosen  for  maximum  allelic  diversity.  Based  upon  results  from  multiple  breeds, 
thisgenotyping  system  iscapable  of  discriminating  between  any  two  random, 
unrelated  individuals  in  the  population  at  a  probability  level  of  2xl013  (i.e.  odds 
ratio  of  1:1  trillion). 

Using  thisgenotyping  test,  ARS  scientists  Will  La  eg  re  id,  Mike  Clawson,  and  Mike 
Heaton  were  able  to  provide  strong  scientific  evidence  thatthe  Washington 
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state  index  cow  was  one  of  a  group  of  81  females  that  had  c  rossed  the  U.S. 
borderfrom  Canada  several  yearsearlier.  Thisprovided  the  missing  link  in  the 
traceability  of  this  female  to  verify  that  she  was  bom  prior  to  the  1997 
implementation  of  a  full  ban  on  animal  protein  feeding  in  Canada. 

A  month  before  the  Washington  state  BSEcase  was  identified,  ARS  scientists 
reported  the  complete  prion  gene  sequence  variation  in  U.S.  cattle.  A  quick 
comparison  of  results  showed  that  the  BSE  index  casesdid  not  have  novel  prion 
gene  sequences.  This  was  important  because  DNA  sequence  analysisof  both 
the  May  20th,  2003,  Canadian  BSEcase  and  the  Washington  state  case  had 
revealed  apparently  novel,  but  different,  mutations  in  their  prion  gene 
sequences.  This  would  have  otherwise  suggested  the  occurrence  of  "sporadic 
BSE,"  i.e.  not  related  to  contaminated  feed.  Together,  the  new  bovine  genomic 
tools  had  a  n  immediate  and  positive  impact  on  the  beef  industry,  DNA 
traceback,  and  animal  disease  control. 


V.  Plans  for  FY  2004 


Beyond  Sequencing-Focusing  on  Bioinformatics  and  Functional 
Genomics  Tool  Development  for  Domestic  Animal  Species 

With  the  esta  blishment  of  a  genomics  infrastructure  fora  numberofthe 
domestic  animal  species,  the  Interagency  Working  Group  is  now  turning  its 
attention  to  itsgoalsin  the  areasof  bio  informa  tic  sand  functional  genomics.  To 
reiterate,  those  specific  goals  are: 

•  Data  management  and  bio  informa  tics  to  specifically  supportagriculturally 
important  species.  This  will  support  significant  improvements  in  data 
managementand  ana  lysis  software,  allow  greaterdata  accessibility  and 
secure  long  term  maintenance,  increase  capabilities  to  deal  with  rapidly 
accumulating  data  complexity  asdatabasesinclude  functional  information, 
a  nd  provide  more  powerful  tools  to  mine  large  genomes  (individually  and 
comparatively). 

•  Functional  genomicsto  specifically  investigate  agriculturally  important 
species.  An  increase  in  data  forlivestockgenomesrequiresconcomitant 
investment  in  functional  genomicsto  support  genome  annotation,  the  study 
of  gene  regulation  and  expression,  and  speciesevolutionary  relationships. 
Researchers  will  have  access  to  individual  genes,  which  can  be  cloned  and 
characterized,  the  ability  to  scan  entire  genomes  for  specific  functions,  and 
be  able  to  gain  new  insights  from  comparisonsto  othergenomes. 
Quantitative  trait  loci  will  be  accessible  on  a  gene  level,  leading  to  studies 
thatwill  elucidate  characteristics  such  asfood  productivity  and  disease 
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resistance  and  will  lead  to  the  ability  to  manipulate  those  traits  more  quickly 
and  effectively. 

USDA  is  planning  a  workshop  be  held  in  Washington,  DC  on  September  22-23, 
2004  to  specifically  develop  a  road  map  foraddressing  the  infrastructure  needs 
in  domestic  animal  bioinfomaticsand  functional  genomics  Thisworkshop  will 
bring  together  leaders  from  the  domestic  animal  genomics  research 
community  with  leading  authorities  from  the  human  and  model  species 
communities  to  flesh  outthisplan.  The  resulting  white  paperreport  will  be 
presented  to  the  IWG  for  consideration  in  carrying  out  itsmission  to  move  efforts 
forward  in  these  areas. 


Development  of  Broad-Based  Functional  Genomics  Programs  in 
Domestic  Animal  Infectious  Disease 


One  of  the  most  important  potential  applications  of  domestic  animal  genomics 
is  in  the  area  of  infectiousanimal  disease.  Recent  attention  on  disease 
outbreaks  with  potential  human  impact,  such  asthe  avian  flu  outbreak  in  Asia, 
North  America,  and  Europe,  and  prion  diseases  such  asBSE  in  cattle,  scrapie  in 
sheep  and  chronic  wasting  disease  in  elkand  deer  have  accentuated  the 
need  for  better  unde  rsta  nding  of  the  genomes  of  both  host  and  pathogen. 
Genomicsoffersthe  opportunity  to  better  characterize  genetic  susceptibility  to 
threatening  diseases,  particularly  those  transmitted  between  humansand 
animals  With  the  completion  of  annotated  genome  sequencesforthe  major 
domestic  animal  species,  the  opportunities  to  pursue  these  genomics 
approaches  a  re  at  hand. 

In  the  coming  year  the  question  of  what  will  be  required  forthe  publicly  funded 
research  community  to  develop  broad -based  domestic  a  nimal  genomics 
research  prog  rams  to  ad  dress  these  critically  important  animal  disease  a  nd 
biosecurity  issues  needsto  be  addressed.  This  is  related  to  the  Presidential 
Directive  HSPD-9  which  calls  fora  national  plan  forprotection  of  agricultural 
interests  within  Homeland  Security. 


Expanding  the  Genomics  Infrastructure  to  Include  Additional  Species 


Expanding  the  genomics  infrastructure  beyond  the  majordomestic  animal 
species  foe  used  upon  in  the  IWG'sinitial  efforts  (chic  ken,  cow,  pig,  dog, 
honeybee)  isa  majorgoal  in  the  coming  year.  Effortsare  underway  in  a 
numberof  other  species  to  develop  community  resourcesin  the  public  domain 
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including  BAC  mapsand  ESTcollectionsaswell  asdraft  genome  sequence 
assemblies. 


The  Department  of  Energy'sJ  oint  Genome  Institute  at  Walnut  Creek,  California 
recently  announced  that  it  will  accept  white  pa  per  proposals  for  whole 
genome  shotgun  sequencing  through  theirCommunity  Sequencing  Program. 
A  number  of  species  of  interest  to  the  IWG  were  submitted  including  the 
channel  catfish,  rainbow  trout,  and  Pacific  oyster. 

While  nota  domestic  animal,  a  mammal  of  considerable  interest  to  marine 
mammologists  is  the  dolphin.  Efforts  in  the  coming  year  by  the  Office  of  Naval 
Research  are  seeking  to  launch  a  dolphin  genomicseffort. 

The  sheep  genomicscommunity  hasrecently  launched  an  effort  to  build  a 
comprehensive  BAC  map  of  the  sheep  genome,  working  with  the  Children's 
Hospital  of  Oakland,  Oakla  nd,  California .  The  domestic  cat  a  Iso  continues  to 
be  rated  ata  "Moderate"  priority  for  sequencing  by  NHGRI. 


Does  Genomics  Technology  Offer  the  Solution  for  Animal  Traceability  and 
a  National  Animal  ID  System? 


Asdescribed  earlierin  this  report,  DNA  genotyping  wasused  in  the  recent  U.S. 
BSE  crisis  through  its  use  to  verify  the  Canadian  origin  of  the  Washington  state 
BSE  indexcow.  Almost  immediately  following  the  discovery  of  thiscase,  the 
Secretary  of  Agriculture  directed  the  establishment  of  a  nationa  I  a  nimal  ID 
system  to  allow  full  traceability  of  animals  in  such  biosecurity  events  in  the 
future.  Planning  fora  national  animal  ID  system  to  allow  traceability  was 
already  well  underway,  however. 


In  Octoberof  2003,  the  U.S.  Animal  Identification  Plan  (USAIP),  developed  under 
the  fac  ilitation  of  USDA's Animal  a  nd  Pla  nt  Health  Inspection  Service,  was 
endorsed  by  the  U.S.  Animal  Health  Association.  While  the  USAIP  laysoutthe 
basic  plan  fora  national  animal  identification  system,  with  the  goal  of  allowing 
traceability  of  a  n  animal  within  48  hours  of  a  biosecurity  breach,  it  does  not 
stipulate  the  platform  for  the  system.  DNA  genotyping  has  high  potential  for 
providing  such  a  platform. 


By  increasing  the  genotyping  system  used  by  USDA  to  verify  the  parentage  of 
the  BSE  index  cow  in  2003  from  32  to  45  SNPs,  the  discrimination  power  can  be 
ra  ised  to  a  level  that  a  Hows  the  unique  identification  of  any  a  nimal  drawn 
randomly  from  the  cattle  population,  although  allelic  frequencies  of  additional 
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SNPs must  be  validated  acrossadditional  industry  breed  populations.  Currently, 
an  industry  collaboratorhasmoved  the  SNP  battery  to  include  47  SNPs(38 
highly  informative  SNPs,  7  Brahman-specific  (i.e.  Indie  us)  SNPsand  two  sex¬ 
determining  SNPs). 

The  application  of  genomic  technology  to  livestock  improvement  and 
biosecurity  isbeginning  to  occur.  Several  DNA  diagnostic  testsare  now 
commercially  available  in  the  public  domain  with  a  number  under  industry 
developmental  testing.  An  even  greaternumberof  markersare  routinely  being 
used  within  private  industry,  particularly  in  the  case  of  swine.  The  numberof 
DNA  diagnostics  is  expected  to  increase  exponentially  with  the  release  of 
annotated  whole  genome  sequence  data  forchicken,  cattle,  and  swine  within 
the  next  few  years,  the  increase  in  technologies  allowing  clearer  ana  lysis  of 
gene  function  and  regulation  (microarrays,  serial  ana  lysis  of  gene  expression, 
RNAi,  etc),  and  the  integration  of  genetic  and  physical  mapsforthese  species 
(integration  of  linkage,  radiation  hybrid,  and  BAC  maps).  Such  DNA  diagnostics 
will  allow  the  determination  of  DNA  genotypesof  individuals forboth  genetic 
improvement  and  precision  managementapplications,  including  prevention  of 
risk  of  disease  outbreak. 


The  national  animal  ID  system  could  be  the  vehicle  to  allow  the  widespread  use 
and  adoption  of  DNA-based  management  systems.  The  determination  of  a 
unique  animal  ID,  based  upon  the  animal'sDNA  profile  fora  pre-determined 
SNP  battery,  could  serve  two  purposes.  First,  unique  identification  of  an  animal 
is  easily  accomplished.  Secondly,  the  SNP  battery  can  be  designed  to 
simultaneously  genotype  economically  important  diagnosticsforgenetic  and 
ma  nagement  purposesforthose  who  wish  to  have  access  to  such  information. 


Previous  use  of  DNA  genotyping  foranimal  identification  hasbeen  almost 
exclusively  forparentage  verification.  Widespread  implementation  of  the 
technology  has  been  very  slow,  primarily  due  to  the  high  cost  ($25  per  sample), 
in  using  microsatellite-based  genotyping  systems.  Economic  modeling  in  beef 
cattle  systems  has  shown  that  the  cost  of  this  techno  logy  must  be  in  the  $5  to  $7 
perhead  range  to  be  practical.  Highervolume  would  be  expected  to 
substantially  reduce  perhead  cost. 

Current  genotyping  platforms  have  achieved  remarkable  throughput  and  costs 
have  lowered  dramatically  in  recent  years.  Current  estimates  range  from  $.25 
to  $.50perSNP  in  high  volume  operations.  Using  a  mas  spectrometry  platform, 
most  companies  estimate  that  the  lowerend  of  thiscost  range  isfeasible.  Thus, 
a  DNA  genotyping  system  employing  40  markers  could  be  commercially 


25 


COORDINATION  OF  PROG  RAMS  ON  DOMESTIC  ANIMALGENOMICS;  THE  FEDERAL  FRAMEWORK 


available  for$10  orlessin  high  volume  systems,  aswould  be  the  case  in  a 
national  animal  ID  scheme.  Several  commercial  vendorsare  attempting  to 
enter  va  rious  pa  rts  of  this  type  of  ma  rket. 

The  practical  application  of  a  DNA-based  system  would  operate  in  the 
following  manner: 

•  An  animal  would  be  sampled  forDNA  simultaneously  when  a  unique  bar 
code  isassigned  to  it  as  its  individual  animal  ID  (USAIN  in  the  USAIP).  Tagging 
systems  a  llo  wing  such  to  occurusing  electronic  ID  (EID)  are  available  in  the 
marketplace  which  allow  retrieval  of  DNA  from  blood  spotted  to  a  card  at 
the  time  of  tagging,  already  cross-referenced  to  the  barcode  of  the  tag. 

•  The  DNA  genotype  forthe  animal  would  be  determined  and  entered  in  the 
national  database  by  cross-reference  to  the  barcode. 

•  If  the  animal  wasinvolved  in  a  situation  requiring  trace-backand  verification, 
DNA  would  alwaysbe  available,  through  to  the  retail  cut  basis,  to  do  so. 

•  If  external  identification  fora  n  anima  I  was  lost,  the  a  nimal  could  be  easily  re- 
genotyped  to  determine  its  original  USAIN. 

•  If  the  DNA  genotype  information  wasdesired  by  the  ownerof  the  animal,  the 
commercial  vendorcould  make  such  available  on  a  fee  basis,  allowing  the 
offsetting  of  a  portion  of  the  genotyping  cost  of  the  national  animal  ID 
system. 

The  application  of  genomics-based  technology  in  a  national  animal 
identification  system  will  be  carefully  evaluated  within  federal  research 
programs  in  the  coming  year. 
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VI.  Appendix 


URLsforuseful  information  on  domestic  animal  genomics: 

National  Science  and  Technology  Council  homepage: 

http :// www.ostp.gov/NSTC/html/ NSTC_Home.htm  I 

USDA  Research,  Education,  and  Economics  homepage: 

http  ://www.reeusda  .gov/ree/ 

NSF  Directorate  for  Biological  Sciences  homepage: 

http :// www. nsf.gov/ bio 

DOE  Office  of  Science  homepage:  http://www.sc.doe.gov/ 

National  Human  Genome  Research  Institute  homepage: 

http :// www. nhgri.nih.gov/ 

DbEST(ESTdatabase  at  the  NCBI):  http://www.ncbi.nlm.nih.gov/dbEST/ 

Online  Mendelian  Inheritance  in  Man  Database  at  NCBI: 
http  ://www.ncbi.nlm.nih.gov/Omim 
GenbankatNCBI:  http://ww.ncbi.nih.gov/Genba  nk 
National  Centerfor  Biotechnology  Information’s  Map  Viewer 
http :// ww.ncbi.nlm.nih.gov/mapview 

European  Molecular  Biology  Laboratory's  Nucleotide  Sequence  Database,  EMBL-Bank: 

http://ww.ebi.ac.uk/embl/index.html 

DNA  Data  Bankofjapan,  DDBj :  http://ww.ddbj.nig.ac.jp 

Lfoiversity  of  California,  Santa  Cruz'sGenome  Browser 

http  ://ge  nome.ucsc.edu/cgi-bin/hg  Gateway 

European  Bioinformatics  Institute's  Ensembl:  http://ww.ensembl.org 

Vtbshington  University  Genome  Sequencing  Center 

http  ://ge  nome.wustl.edu/  projects 

Broad  Institute,  MU:  http://www.broad.mit.edu/ 

BaylorCollege  of  Medicine  Human  Genome  Sequencing  Center 

http ://  hgsc  .be  m.tmc  .ed  u 

ChickNet  http://chicken-genome.org 

Alliance  for  Animal  Genomics  Research: 

http  ://eve  rsoleassociates.com/alliance 

Centerfor  Integrated  Animal  Genomics,  Iowa  State  University: 

http  ://c  iag.iastate.edu 

Biotech  Center,  University  of  Illinois:  http://www.biotec.uiuc.edu 
USDA  Agricultural  Research  Service:  http://www.ars.usda  .gov 
USDA/ARSU.&  Meat  Animal  Research  Center 

http  ://www.sol.marc.usda  .gov/ 

USDA/ ARS  Beltsville  Animal  and  Natural  Resources  Institute: 

http :// Ip  si. bare. usd  a. gov/ 
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